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Abstract 

We treat here the interrelation between formal languages and those 
dynamical systems that can be described by cellular automata (CA). 
There is a well-known injective map which identifies any CA-invariant 
subshift with a central formal language. However, in the special case of 
a symbolic dynamics, i.e. where the CA is just the shift map, one gets 
a stronger result: the identification map can be extended to a functor 
between the categories of symbolic dynamics and formal languages. 
This functor additionally maps topological conjugacies between sub- 
shifts to empty-string-limited generalized sequential machines between 
languages. If the periodic points form a dense set, a case which arises 
in a commonly used notion of chaotic dynamics, then an even more 
natural map to assign a formal language to a subshift is offered. This 
map extends to a functor, too. The Chomsky hierarchy measuring the 
complexity of formal languages can be transferred via either of these 
functors from formal languages to symbolic dynamics and proves to be 
a conjugacy invariant there. In this way it acquires a dynamical mean- 
ing. After reviewing some results of the complexity of CA-invariant 
subshifts, special attention is given to a new kind of invariant subshift: 
the trapped set, which originates from the theory of chaotic scattering 
and for which one can study complexity transitions. 
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1 Introduction 



This article is organized as follows: Section 2 treats cellular automata and in 
particular the shift map as dynamical systems on subshifts, i.e. closed and 
shift invariant subsets of sequence spaces (cf. @, 0). Besides some standard 
CA-invariant subshifts a new kind (the trapped set) is defined, which origi- 
nates from chaotic scattering theory To this purpose formal multiscattering 
systems (FMS) are introduced and two examples, the truncated double horse- 
shoe and the truncated sawtooth are given (cf. [[Li], |20|). Section 3 is a short 
introduction in formal language theory (cf. |17|, ||, f|). In particular, we define 
a strucure of families of languages called trio whose closure properties have 
a dynamical correspondence. An important example for trios are the levels 
of the Chomsky hierarchy, which offers a means to classify formal languages 
according to their structural complexity. The morphisms which will make 
a link to symbolic dynamics are empty-string-limited generalized sequential 
machines (GSM). They preserve trios. Section 4 addresses this problem of 
how to link the two categories of subshifts and languages (cf. |14|, ^Of). If 
one restricts oneself to shift dynamics then it is possible to construct an in- 
jective functor between these categories. In this case empty-string-limited 
GMS correspond to conjugacies between subshifts. Finally, in section 5, the 
complexity of dynamically generated languages is discussed. Some general 
results (cf. lyj) for CA and some applications in formal multiscattering 
systems (cf.[|19|, |2(J) are reviewed. Another approach to the complexity of 
dynamical systems which uses chaotic limits and factors of finite automata 
instead of finite partitions is presented in [fTEfl . 



2 Dynamical Systems on Subshifts 
2.1 Basic definitions 

We discuss here dynamical systems whose dynamical map operates on se- 
quences. We permit sequences over some finite set A of abstract symbols 
with cardinality ^A > 1 and define the full shift over A as the complete set 
of sequences A z . 

Take the usual topology onA , i.e. the product topology of the discrete 
topology on A. A base for this topology is given by the cylinder sets which 
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consist of all sequences having a common middle segment around the zero 
index: 



Cyl(ai ... a. 



ft. 



) = {s eA Z ; s t 



= a, 



for - [^i] < • < [£]} (1) 



which are both open and closed. This topology is compact (by Tychonov) 
and metrizable by the sequence metric d: 



where a, b gA , 5 xy — if x — y, 1 otherwise. 

Among the continuous maps operating on A z shift maps and cellular 
automata (CA) are of special interest because they are compatible with the 
group structure of the index set Z: 

Denote by a% the (right) shift map onA Z : (er(a)), = a i+ i. Usually we 
write just a for a%. A subshift is a closed cr-invariant set inA^. £ is a finite 
subshift if it is determined by a finite set F of forbidden symbol strings, which 
are not permitted to appear in sequences. We call £ cyclic if the periodic 
(w.r.t. a) sequences are dense. A dynamical system given by s subshift 
£ cA^ and the shift map a\T, — > S is called a symbolic dynamics. A natural 
generalization leads to cellular automata (CA). A (1-dimensional) CA is any 
continuous map r onA^ which commutes with the shift map. Since any CA 
is automatically uniformly continuous an equivalent description of a CA (cf. 
[0> 0) can be given by a local function f T : A 2R+1 — >A with the property 
(r(s))i = f T (si- R , . . . , s i+R ) for any s eA z . 

The appropriate morphisms for topological dynamics are semi-conjugacies. 
We define them here just for shift maps as any continuous map $ between 
subshifts which intertwines between the respective shift maps c% 



If, moreover, $ is a homeomorphism, it is called a conjugacy (between shift 
maps). It may be interpreted as a continuous change of variables which 
identifies the shift maps on and 




(2) 



$ : — » cr<£ o $ = $ o a% 



(3) 
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2.2 Some standard CA-invariant subshifts 

As for general dynamical systems one introduces the forward (and backward) 
limit set, the periodic sets and the non- wandering set. Because of the corn- 
mutability of a CA with the shift map they are also subshifts. Let r be again 
a CA overA^ which need not necessarily be invertible. 

Definition 2.1 • The forward limit set A + (r) is the intersection of all 
forward images: 

oo 

A+(r) = fl r n (A X ) (4) 

n=0 

• The periodic set n(r) is the topological closure of the set of all cycles: 

n(r) = Per(r,A z ) (5) 

• The non-wandering set Q(r): 

Q( T ) = {s gA Z ; V neigbourhood of s 3 n G N s.t. r n {U) n U ^ 0} (6) 

Remark 2.2 1. All the sets just introduced are r-invariant subshifts. 

2. The forward limit set is the maximal invariant subshift ofA^. 

3. n(r) C Q(t) c A+(r). 

4. The dynamics on these invariant sets can be very complicated. We 
recall here the phenomenon of chaos, for which we shall assume the 
following (topological) definition 0: 

As invariant set we take a r-invariant subshift S cA^. Following || 
we call r|S chaotic if 

• (i) r depends sensitively on initial conditions. 

(3 6 > OVs G £Vneighb.t/ s C £3t G U 8 , n G N s.t. d(r n (s), r n (t)) > 

• (ii) r has a dense orbit. 

• (hi) S is r-cyclic, i.e. n(r|E) = S. 

In |l| it is shown (in the general context of dynamical systems on metric 
spaces) that condition (i) can be replaced by requiring the invariant set to 
be infinite. 
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2.3 Another CA-invariant subshift: the trapped set 

We are going to introduce now a generalization of the usually studied limit 
sets of a CA: 

Definition 2.3 Let T be a closed subset of the compact sequence spaceA z 
and CA. Furthermore, we require that either r be the shift 

map or T be a subshift. We define as trapped set A(r, T) the T-maximal 
subshift 

E:=A(r,r)= fV(r) (7) 

ieZ 

This definition is motivated by scattering systems (cf. next subsection). 
The selection principle by which certain sequences inA^ and consequently 
(because of compactness) certain segments are forbidden in X is often called 
pruning. 

2.3.1 Trapping in Formal Multiscattering Systems (FMS) 

Definition 2.4 Let M be a metrizable space and T : M — > M be a bijective 
map. Furthermore, let Q C M be a compact subset and define the following 
images: 

P := T(Q) n Q, % E Z (8) 

The pair (T, Q) is called a formal multiscattering system (FMS) if the fol- 
lowing conditions are satisfied: 

(i) homeomorphism property: The restriction T\Q — > T(Q) is a home- 
omorphism. 

(ii) escape property: Vn G N : T n+1 (/- n ) \ Q ^ 0. 

(iii) trapping property: Vn e N : T n+1 (/- n ) n Q ^ 0. 

(iv) no-return property: if x <E Q, T(x) £ Q then Vn G N : T n (x) G" Q. 

Remark 2.5 1. The set Q may be interpreted as the interaction set of 
some finite range potential. The dynamics restricted to the complement 
M \ Q is then the comparison dynamics of some scattering problem. 
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2. Properties (iii) - (iv) are also valid for T 1 because of the bijectivity 
of T and the following lemma. 

3. Properties (iii), (iv) are equivalent to Vn G N : I n ^ and T(I n )\Q ^ 

Q. 

Lemma 2.6 With the notation of the preceding definition we have: 

(i) = T{i n ) hq = nr=o 

(a) T- j {P) = r j n r- j , fori>j>o. 

Definition 2.7 We define the trapped set of a FMS (T, Q) as its Q-maximal 
invariant set, i.e. 

A:=A(T,Q):= f) T\Q) (9) 

Lemma 2.8 The trapped set A of a FMS is non-empty and compact. 
Proof 

A ^ f) follows from the trapping property (iii) which implies by lemma ^]6| 
the finite intersection property rft=- n Ti (Q) = WL-n 1 * = I n f] I~ n ^ 0. 
Compactness of Q yields the assertion. 
□ 

Definition 2.9 (i) A FMS (T, Q) is called expansive if T\A is expan- 
sive in the usual sense, i.e. if 3 5 > s.t. {x ^ y implies 3 n G 

Z : d(T n x,T n y) > 5}. 

(ii) A finite open cover a of A(T, Q) is called a generator for the FMS 
(T,Q) if V sequences (A$) G a Z : H^C^i) contains no more than 1 
point in Q. If fl^(A) contains no more than 1 point, a is called a 
weak generator. 

As in the standard situation (cf. ||22|| ) one has 
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Lemma 2.10 The FMS is expansive iff it has a generator iff it has a weak 
generator. 

For an expansive FMS the existence of a symbolic dynamics is shown by 
practically copying the standard proof . 



Theorem 2.11 If (T,Q) is an expansive FMS, then there is a surjective 
semi-conjugacy $ : X — > A(T, Q) from a subshift £ over some finite alphabet 
A. 

If there is a cover of disjoint sets one finds the following strengthening: 

Corollary 2.12 If the FMS (T,Q) has a disjoint generator, then $ can be 
chosen to be a topological conjugacy. In this case the trapped set A(T, Q) is 
totally disconnected. 

Finally we are introducing parameterized families of FMS. 

Definition 2.13 A truncated family of FMS is given by a parameterized 
family of FMS {T K ,Q K ) K( zj over some interval J C M n as parameter set, 
where each Q K is the truncation of a common set Q D Q K by a family of 
level lines given by a cut-off function / : M — ► R and an evaluation function 
e : J -> R + : 

Q K = {u e Q : |/(«)|<e(«)} (10) 
We shall now give a simple example: 



2.3.2 Example 1: The truncated double horseshoe 

Consider the following variant of Smale's piecewise linear horseshoe map: As 
usual, we perform on the unit square Qi = [0, l] 2 first a linear horizontal 
contraction and a vertical expansion by positive factors A < 1 and A -1 > 1 
respectively, followed by a folding, so that the folded parts fall outside Q\. 
But we choose a double folding, whose purpose is to make the horseshoe map 
T = restricted to J -1 := Q 1 H T~ l {Qi) not only linear but to remove 
the reflection contained in Smale's horseshoe map, i.e. 
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To construct a symbolic dynamics of this hyperbolic system we use the par- 
tition of J -1 into two disjoint horizontal strips H and Hi and define a topo- 
logical conjugacyQ 

h : A(T, Qi) -> {0, 1} Z , it i-> (a,) 4eZ with T(x) E H ai (12) 

We introduce the following lexical pseudo-order (not antisymmetric) on the 
full shift {0,1}^: Let a, b G {0,1}^ and define the horizontal line H a = 
n^o^,then 

a < b for the smallest i > s.t. a, 7^ 6j one has a io < 6 io (13) 

This pseudo-order corresponds to the arrangement of the corresponding hor- 
izontal lines: 

(#(a0£o) > % (#(&0£o) ^ a > fe ( 14 ) 

where we denoted by 7r y the projection to the y-component. 

We now interpret Qi as the maximal interaction set, which will be trun- 
cated by the pruning function ir y to yield a 1-parameter family of pruned 
scattering systems with interaction sets 

Qs/max := {ueQi: 7r y {u) < y max } (15) 

In the notation of definition |2.13| the evaluation function e is the identity 
on [0, 1] and the cut-off function / is the projection 7r(y). The correspond- 
ing map T ymax is defined by modifying the horseshoe map T only outside 
the box Qy max to make points drift to infinity without returning whereas 
I Qj/max = ^IQj/max' This yields a 1-parameter formal multiscattering 
system (T VmaK , Qy ma Jy mBx e(o,i]- Each time a test particle "jumps" out of the 
box Q Vmax it escapes for good. The trapped set we are interested in is the 
Q ymax -maximal invariant set 

A(T, max , Q yin J = {ue Qi; 7r y (Ti(u)) < y max Vz G Z} (16) 

It consists of all trajectories which never jump out of the box Q ymax - 

Lowering ?/ max will make it easier to get out of the box Q ymax . Formerly 
trapped trajectories manage to escape, so that the trapped set is bound to 
shrink. Actually, we are going to describe the evolution of the trapped set 

1 i.e. h is a homeomorphism and h o T = a o h. 
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in the symbolic dynamics denned above, where the variation of the selection 
condition y < y max will lead to pruning rules determining which symbol 
sequences correspond to trapped trajectories. 

In this model truncation is trivially transferred to symbolic dynamics 
where the trapped set equivalent to A(T ymax , Q Vmas .) is the subshift 

K := {s G {0, 1} Z ; V/ G Z : < v] (17) 

where v is the binary expansion of y max - 

2.3.3 Example 2: The truncated sawtooth 

This example originates from a physical model of the scattering of a point 
particle in an infinite array of non-overlapping elastic scatterers which are 
placed at unit distance from each other along the y-axis (cf. [|8|, ^l]]). The 
symbolic dynamics of this model is a family of FM S over the alphabet A = 
{ — 1, 0, 1} with interaction set 

r„ = {seA z ; \f(s)\<v}, ,eS N », h = {-2,-1,0,1,2} 

f :A Z - & N o, s _> So ( Sl + s _ x ) . . . ( Si + s _.) . . . (18) 
and lexical ordering on £ N o. 

3 Formal Languages 

This section introduced the category of formal languages and some subcate- 
gories. It is based on [|17], [|. 

3.1 Basic definitions 

We start from a finite setA of abstract symbols, which will be called alphabet 
in the following. Define the free semi-group of words or (finite) stringsA* = 
\J^=oA n , which is called the total language over A The semi-group operation is 
the concatenation. Its unit element is the empty string $. Powers correspond 
to repetitions of symbols, the length lg(s) of a string s eA* yields a formal 
logarithm. The total language minus the empty string is written asA + . Any 
subset L of the total language A* is called a language. A word x G L is 
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a subword or segment of a word y G L, written x -< y if there are a, 6 G 
A* s.t. y = axb. A homomorphism h between total languages is understood 
with respect to the semi-group structures. If, moreover, = {$}, 

then h is called %-free. When we speak of an inverse homomorphism h~ x we 
mean the set valued function of preimages of singletons under h. 

3.2 Grammars Generating Formal Languages 

In this section we shall discuss how to generate a formal language by a 
grammar. This concept is based on the notion of a rewriting system. 

Rewriting systems A rewriting system RW = (A, P) is given by an al- 
phabet A and a finite set P cA* xA*. The pairs (S,T) G P are referred to 
as rewriting rules or productions and written as S — > T. 

Assume S — > T, T — > U G P. We are allowed to apply these productions 
also to subwords within a word and using the notation u ^rw" or in short 
to generate a new word directly, i.e. in 1 step: R1SR2 =>■ R\TR 2 
or in finitely manyQsteps: RiSR 2 => R\TR 2 => RiUR 2 which we write as 
RiSR 2 =£* R\UR<2- 

Definition 3.1 A generative grammar G = (A.Tj[y,X , P) is now a rewrit- 
ing system (A, P) where the alphabet A = At OAy is partitioned into ter- 
minals and variables or nonterminals. The nonterminal alphabet Ay con- 
tains a distinguished letter, namely the initial letter X . In any production 
S^TeP the word S being "processed" must contain at least one variable, 
i.e. S £A* T . 

When we use G to generate its associated language Lq we start with the 
initial letter X Q , and do not stop in the generating process before we have 
produced a terminal word inA^: 

L G = {SeA* T : X ^S} (19) 
2 The relation =>■ is the reflexive transitive closure of the binary relation =4> i.e. =>:= 

UZo(^Y wherc (=>)° -{M; o€A}. 
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Chomsky hierarchy Grammars are classified by imposing restrictions on 
the form of productions. The most common classification is the Chomsky 
hierarchy: 

Definition 3.2 A grammar G = (A Tl A v , X , P), A = A v U A N , is of the 
type i if the following restrictions (i) are satisfied for the rewriting rules 

(x,YeA v y. 

(0) no restrictions; 

(1) the rewriting rules "depend on the context", i.e. they have all the 
following form: QXR — > QAR, where Q, R, A e A* and A is not the 
empty string $ with the only possible exception: if X — > $ appears as a 
production, then Xq must not occur on the right side of any production. 

(2) All rewriting rules have the form X — ■> A, where A gA*; 

(3) the rewriting rules concatenate only on one side or replace by a terminal 
S eAt, i.e. their form is: X — > SY or X — > S. 

Type 1 grammars restrict type grammars by requiring that their produc- 
tions (with the possible exception of X — > $) are not length decreasing^ 
(the length of a word is the number of its primitive (i.e. either terminal or 
nonterminable) symbols). 

A language is of type i, i G {0, 1, 2, 3}, if it can be generated by a grammar 
of type i. The common names for type % languages are recursively enumerable, 
context sensitive, context free and regular, respectively. We denote the set of 
type i languages by Lj. This classification is properly nested, i.e. L i+1 C Lj. 
A class properly between type and type 1 languages (type 1/2) are the 
recursive languages. A language L is recursive if both L and its complement 
A*\L are recursively enumerable. The Chomsky complexity x{L) of a language 
L is defined as (— l)x the type i of its simplest generating grammar. 

Certain subfamilies of regular languages are of interest, too, such as finite 
complement languages, where a finite set of words F C A* are forbidden 
segments. They correspond to finite subshifts. 

In fact, besides these language types many other types have been studied. 
Many of them have common properties. We shall define just one underlying 

3 This is even an equivalent characterization of type 1 grammars. 
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structure, called trio structure. For this we have to introduce families of 
languages. 

Definition 3.3 Let 7 be an infinite alphabet and L a set of languages over 

o * 

7, i.e. L e 2 The pair (7, L) is called a family of languages if 

(i) V7 G L 3A C 7 s.t. A is finite and L cA*. 

(ii) 7^0 for some 7 e L. 

A trio is now a family of languages with certain closure properties: 

Definition 3.4 A (full) trio is a family of languages closed under $-free 
(arbitrary) homomorphisms, inverse homomorphisms, and intersections with 
regular languages. 

Example 3.5 The families of regular, context free, and recursively enumer- 
able languages are full trios. The families of context sensitive and recursive 
languages are trios that are not full. 



3.3 Acceptors and Morphisms of Languages 

Acceptors Automata are mathematical models of devices that process in- 
formation by giving responses to inputs. Formal language theory views them 
as scanning devices or acceptors able to recognize, whether a word belongs 
to a given formal language. A hierarchy of automata corresponds to the 
Chomsky hierarchy of languages, in the sense that any class i language is 
recognized by a class i automaton and conversely any class i automaton pro- 
duces a class i language as output. The corresponding automata are called 
Turing machines (0) linear bounded automata (1), pushdown automata (2) 
and finite deterministic automaton (3). We refer for details to the literature 
(e.g.[0, 



Morphisms An intrinsic justification of a classification scheme are closure 
properties of its classes under naturally associated morphisms. We choose 
the maps induced by $-limited generalized sequential machines (GSM). They 
are generalizations of homomorphisms. Their special importance for our 
discussion lies in the fact that they correspond to the morphisms of subshifts, 
i.e. their semi-conjugacies (cf. section fl.3| ). 
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Definition 3.6 A generalized sequential machine (GSM) is given by the six- 
tuple M = (ZyAjlS, u, t, zq), where Z (state set), A (input alphabet), and S 
(output alphabet) are finite sets, r : Z xA — > Z (next state map) and 
oj : Z xA — > S* (output map) are maps. M is called $-free if the empty 
string $ is not an assumed value of u. We extend the output map uj and 
the next state map r to Z xA* by t(z, $) := z, uj(z, $) := $, and for s gA*, 
a gA recursively r(z,sa) := t(t(z,s),o), uj(z,sa) := u>(z,s)u(r(z,s),a) by 
concatenation. 
The map 

M:A*^y*, s^u(z ,s) (20) 

is called a GSM map. A GSM M is called %-limited on a given language 
L cA* if there is a k > s.t. Vs G L : if s = rcyz and cj(t(,20) ^O; 2/) = $ f° r 
some x, y, z gA* then lg(j/) < k. 

An important property of $-limited GSM maps is that they keep trios and 
hence the Chomsky classes invariant if one neglects the empty string (cf. ||). 

Lemma 3.7 For each trio L, each L G L ; and each GSM map M %-limited 
on L we have: 

(i) M(L) \ {$} G L 

(ii) If h is a full trio or if L is closed under union with $ or if a G 
L, M(a) = $ implies a = $, then even M(L) G L. 

(Hi) M~\L) G L. 

Corollary 3.8 Let £ be the language category with languages as objects and 
%-limited GSM maps as morphisms. Any full trio forms a subcategory. 

4 Subshifts and Languages — 

how they fit together 

In this section we are going to study how topological dynamics can be linked 
with formal language theory. There are two approaches. In the first, which 
we will not treat further, one extends formal language theory to encompass 
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bi-infinite words. In the second, one establishes a bijective correspondence 
between subshifts and so-called central languages. We are going to follow 



two tracks. The first, introduced in [13 1, constructs a bijection between 
the set of all subshifts and the set of all central languages, which will be 
defined presently. In the second, we restrict our attention to chaotic symbolic 
dynamics, which we required in particular to be cyclic (cf. section |2.2| ), and 
identify those languages which correspond naturally to a cyclic subshift (cf. 



4.1 Dynamically generated languages 

First we define the languages we associate to general and cyclic subshifts, 
respectively. These are the associated central language which consists of all 
segments of permitted sequences and the associated cyclic language, which 
consists of all full periods of a cycle, or more formally: 
Define the periodization map 

( A : A* -f Per(a4 Z ), s ^ s (21) 

where s t := s, mod i g(s) . 

Definition 4.1 Let £,£' C be an arbitrary and a cyclic subshift (i.e. 
II(cr, S) = £), respectively. Then 

(i) £(£) = {a G A*; 3s G S : a -< s} is called the associated central 
language and 

(ii) £ c (£') = C^ 1 (Per(a, £')) U {$} is called the associated cyclic language. 



Observe that C C (T!) contains together with any word w all its repetitions w\ 
i > and all their cyclic permutations. An easy exercise shows the following 

Lemma 4.2 The maps C and C c are injective. 

Therefore different subshifts can still be distinguished on the level of their 
associated languages. 
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4.2 Adherences and centers 



A map in the opposite direction, i.e. from the set of all languages to the set 
of all subshifts, is the adherence. It maps a language onto the subshift of all 
sequences whose segments are also segments of words of the language: 

Definition 4.3 Let L cA*. 

Adh(L) = {s eA Z ; Va -< s3w G L s.t. a -< w} (22) 
is called the adherence of the language L. 

Obviously, Adh(L) is closed and shift invariant and hence a subshift. The 
language operators C and C c are in fact inverses of the properly restricted 
adherence operator Adh. The right restrictions can be characterized as in- 
variants of the center operators C, C c : 

Definition 4.4 We define the following two operations on the set of lan- 
guages over A: 

(i) The center C(L) of a language L is the language 

C(L) = {a gA*; ViV > 3x, y gA*, \g{x),\g{y) > N s.t. xay G L} 

(23) 

A string in C(L) is sometimes called bi-extensible in L. 

(ii) The cyclic center C C (L) is 

C C {L) = {a eA*; Vi > OV permutations 7r(a l ) 3x, y &A* s.t. X7r(a l )y G L} 

(24) 

We call a language L central if C(L) = L and cyclic if C C (L) = L. One 
sees from this definition immediately that the associated central language 
of a subshift is central and that the associated cyclic language of a cyclic 
subshift is cyclic. Conversely, by the following theorem linking subshifts 
and languages, one sees that any central language is the associated central 
language of some subshift, and likewise any cyclic language is the associated 
cyclic language of some cyclic subshift. 

We now state the announced theorem, whose first part (i) is due to [14 

Theorem 4.5 Let L be a language over A. 
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(i) IfT, is any subshift over A, then 



Adh(£(E)) = E (25) 
£(Adh(L)) = C{L) (26) 



(ii) IfT,' is a cyclic subshift over A, then 

Adh(£ c (E')) = E c (27) 
£ c (Adh(L)) = C C (L) (28) 

Proof 

ad (i) eq. By applying the definitions one sees that £(Adh(£(E))) = 
£(S). The injectivity of £ implies the first assertion. 

For eq. |26] let first a G £(Adh(L)), i.e. 3 s G Adh(L) s.t. a = (ai . . .a n ) = 
(si . . . s n ) -< s. Let N > and 6 := Si_at . . . s n+ iy. We have b G £(Adh(L)) 
and a -< b -< s. By the definition of the adherence 3 c G L s.t. a ~< b ~< c. 
Hence a G C(L). Conversely, let a G C(L). Then a must have an infinite 
number of bi-extensions in L. This implies the existence of a nested sequence 
of words to W g C(L), i > 0, = a) with strictly increasing length 

lg(iyW) — > oo. By compactness ofA z the set W = f|^o Cy\(w^) is nonempty, 
hence a singleton {-u/ 00 -*} which by construction is contained in Adh(L). As 
a ~< w ioo \ we get a G £(Adh(L)). 



ad (ii) eq.|7|"c": Let s G Adh(£ c (S')). Then Va -< s3w G £ C (SQ s.t. a -< 



to. But to G £ C (S') means that w G E'. As E' is cyclic (Per(<r, E') = E'), 
we conclude s G E'. "d": As the adherence of a language is a subshift, it 
is enough to show Per(<r, E') C Adh(L c )(E'). So let s G Per(cr, E'). Then 
3a gA* s.t. s = a and Vz > 1 : a 1 and all its permutations are in £ C (E'). 
Therefore s = a G Adh(£ c (E')). 

ad (ii) eq|§ "C": Let a G £ c (Adh(L)). Then a G Adh(L), i.e. Vc -< 
a3x,y gA* s.t. xcy G L. In particular Vi > IV permutations n(a l ) 3x,y E 
A* s.t. xTT{a l )y G L. Hence a G C C (L). "d": Let a G C C (L). Then Vi > 
13x,y gA* s.t. xa*j/ G L. Hence a G Adh(L), so that a G £ c (Adh(L)). 
□ 



Corollary 4.6 (i) The pair of maps (£,Adh) give a bijective correspon- 
dence between central languages and subshifts over A. 
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(ii ) The pair (L c , Adh) gives a bijective correspondence between cyclic lan- 
guages and cyclic subshifts over A. 

4.3 Functors from the category of languages to the cat- 
egory of subshifts 

Let (53) be the category of symbolic dynamics, i.e. its objects are the sub- 
shifts and its morphisms the semi-conjugacies of shift maps. The subcategory 
of cyclic subshifts will be denoted by GD C . Let £ be the category of lan- 
guages with languages as objects and $-limited GSM maps as morphisms. 
We shall now extend the language operators C and C c to yield functors. 

The key observation is that analogously to CA, (cf. |7|, §]) there is a 
description of semi-conjugacies by local functions. 

Let $ : £^ — > be a semi-conjugacy, first between arbitrary subshifts. 
Since £^ is compact $ is even uniformly continuous. Therefore, for each 
/ G No there exists a minimal odd number N(l) > 0, s.t. for the discrepancy 
is of two sequences s,t (i.e. half the length of the longest common middle 
segment), we find 

is(s,t) > N(l) i 5 ms),$(t)) > I (29) 

Since $ commutes with the shift map, it bears a close resemblance with a 
cellular automaton (CA) but for the possibility to change the sequence space. 
Like a CA, $ is determined by a local function albeit defined only on a subset 
D cA 2R+ \ 2R+ 1 := N(l): 

4>: D 4>(si_ R . . .Si. . .s i+R ) = {§{s))i 

D ={a eA 2R+1 ; 3s E S A s.t. a -< s} (30) 

We call R the radius of uniform continuity of $. 

Conversely, let r e N, D cA 2r+1 , (p : D ^ Denoting by S(F) cA Z 
the finite subshift generated by the forbidden set F = A 2r+1 \ D we get a 
semi-conjugacy $ : — ► S<^, ($(s))j = ^(sj_ r . . . . . . s i+r ) onto some 

finite subshift C £? Z . Thus, we have shown 

Lemma 4.7 There is a bijective correspondence between semi-conjugacies of 
finite subshifts and local functions. 
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This local representation of <3> can be used to prove the following 



Lemma 4.8 Each semi-conjugacy between subshifts can be simulated by a 
%-limited GSM mapping both on the associated central languages and on the 
associated cyclic languages (if the subshifts are cyclic). 

Proof 

Let $ : — > be a semi-conjugacy between two subshifts cA z and 

CB Z . Let (f) : A 2R+1 D D -> B, R > 0, be the associated local function 

of eq. For convenience, extend <fi in some arbitrary way to the whole 

^2R+1 

(i) First we examine the central language. This part is an adaptation of a 



proof given by Culik II et al. [14] for cellular automata. Define a map 



£($) : £(S a ) - £(£„), { Sl )U » { 



(si . . . 8 1+2 r) • • • (f)(s n - 2 R • • • s n ) \in>2R 

otherwise 
(31) 

Observe that a string of length n > 2R is mapped by £($) to a string of 

length n - 2/2. Obviously £($(£*)) = £($)(£(£ A ))- 

Next one defines a GSM, whose associated GSM map will be just £($): 

Let M = (Z,A,B, u>, r, z 0) 2"/) with state set Z = Ui=o^\ initial state z — $> 

next state function r and output function cj defined for z G Z, a G A, as 

follows: 

// \ _ / (^)Li a if r < 2i? (( y , _ r $ if r < 2i? 

M is a $-limited GSM on £(£^) because only strings shorter than 2R are 
mapped to the empty string $. Obviously, the GSM map M|£(£^) = £($). 
(ii) For the cyclic language we proceed similarly. Observe, that we identified 
repetitions of cycles s\ i > 0, s G £ c (£). We define a length conserving 
£ c ($) and periodize s = (si)?=i 7^ $ by s n+i := (((s)) n+i = sf 



(Sl . . . Si +2 r) ■ ■ ■ (f)(s n . . . S n+2 R) if (s»)fLi ^ 

otherwise 
(33) 

Again we get £ c ($(£^)) = £ c ($)(£ c (£^)). Next we concatenate to each s G 
A* an end marker •, i.e. we identify s with s«. The $-limited GSM simulating 
the semi-conjugacy $ is defined by Z = \J^ A l , z = $, r(z ,a) := [a; a], 
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h, t gA*. The string h is going to store the leading {2R or up to 4R 2 ) symbols 
of an input word s« (see cases 1,2 in the eq. below): 

l(h t ) r i=1 a; (ti) r i=1 a\ if r < 2R, a ^ • 

[((^)Li) 2R ;((^)Li) 2fi ] Hr<2R,a = . 
[(hi) 2 A;(ti)^ 2 a] Xr = 2R, a ± . 

[{hi)U\{U)U\ ifr>2R, a = . 

f $ if r < 2i? 
.«) = M^** ) ifr = 2i?, a^. (34) 
[ 4>(ti . . . t r hi) . . . <p(t r hi . . . h 2 R) if a = • 

where </>(t) := 0(ti . . . t 2R+1 ) if lg(t) > 2R. Again, we get M|£ C (£ A ) = £ c ($). 



Corollary 4.9 Definition |^.i| together with equations ond defines two 
functors £ : 62) — > £ and C c : 6D C — > £. 



The lemmas 3.7 and H]q yield the announced closure properties: taking fac- 



tors does not increase complexity^. 

Theorem 4.10 Any semi- con jug acy $ between two subshifts preserves any 
full trio the associated (central or cyclic) language belongs to; in particular, 
if £ is an arbitrary subshift, £' a cyclic subshift then: 



X (£ c ($(£'))) < xOCc(EO) 
X (£(*&))) < X(m) (35) 



where L := L \ {$} for any language L. If furthermore, $ is a conjugacy 
then the inequalities just stated become equalities. 

5 Complexity of dynam. generated languages 

In this section we first return to general CA. 

4 Compare this to the analogous statement about topological entropy in jl5| . 
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5.1 Sofic Systems 

The simplest way to generate a language dynamically is by applying some 
CA a finite number of times to a full shift. As it happens these so-called sofic 
systems yield just the regular languages: 

Definition 5.1 Let t be a cellular automaton (CA) over the shift A^ and 
Em a subshift of finite type. The subshift t(Em) is called a sofic system. 

One has (cf. ||10|| ) the following: 

Proposition 5.2 A subshift E is sofic iff £(E) is regular. 



5.2 Some results for CA and their "standard" invari- 
ant sets 



We quote here some results from |fi3|| . First one observes the following con- 
straint for limit languages: 

Theorem 5.3 For every cellular automaton r : A^ 1 -^-X^ the complement 
of the forward limit language A* \ £(A + (r)) is recursively enumerable. 

Apart from this constraint any language is realisable by the limit set of a CA 
in the following sense: 

Theorem 5.4 If L C A* has a recursively enumerable complement, then 
there exists a CA r : tf^ — > a regular language R C B-* , and a homo- 
morphism h : B-* ^A* s.t. : 

/i(£(A+(r)) n R) = L (36) 

In particular examples exist, for which the the central language of the forward 
limit set is not recursively enumerable. 

For the periodic set one has the following results: 

Theorem 5.5 (%) If r is any CA, then £(IT(t)) is recursively enumerable. 

(ii) If L cA* is recursively enumerable, then there exists a CA r : — > 

ifi , a regular language R C R-* , and a homomorphism h : B-* — ► 
A* s.t. ; ' 

h(C(U(r)) n R) = L (37) 
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Theorems |5]3] and |5.5| (ii) describe only the complexity of a sublanguage 
of a limit language. They do not yield better lower limits that theorems 
and |5.5| (i). Examples of limit languages with any of the introduced 

Instead, we shall treat in 



12, 11 



Chomsky complexities can be found in 
the next section examples of families of FMS which illustrate how a variation 
in complexity accompanies a transition to chaos of the associated dynamical 
system. 



5.3 Results for some simple truncated families of FMS 



For this subsection cf. |19], Here we treat the examples of subsections 
2.3.2j and 2.3.3 . In these models one is interested in complexity transitions 
arising by variation of the family parameter and accompanying transitions 
to chaos of these models. For other complexity transitions cf. 0. 

Truncated horseshoe The first observation one makes is, that the pa- 
rameterization map 

{0,1} N <» = [0,2] -> 2 AZ 

v ^ £„ := {s G {0,1} Z ; V/ G Z : < u} (38) 

is locally constant everywhere but on the set 

V = {a G {0, 1} N ; ViGZ: a\a) < a} (39) 

which is called the bifurcation set of the family of FMS. One observes the 
following complexity transition: 

Theorem 5.6 Suppose v G Vfl {1,2}. Then both central language £(£„) 
and cyclic language £ C (E V ) are regular iff p G Q. 



Truncated sawtooth This system becomes chaotic at the critical value 
^crit = 1 — 10. For v < u crit the trapped set is finite, hence the associated 
languages regular. At the critical value one finds: 

Theorem 5.7 The associated languages £(T, U i ) and £<.(£„ . ) are context 
sensitive but not context free. 
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